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Reading Recovery® 

are to promote literacy skills and reduce the number of first-grade 
students who are struggling to read. The program supplements 
classroom teaching with one-on-one tutoring sessions, generally 
conducted as pull-out sessions during the school day. Tutoring, 
which is conducted by trained Reading Recovery ® teachers, takes 
place daily for 30 minutes over 12-20 weeks. 



Program description Reading Recovery ® is a short-term tutoring intervention program 

intended to serve the lowest achieving (bottom 20%) first-grade stu- 
dents. Students are chosen for Reading Recovery® by school staff, 
and selection is based on prior reading achievement, diagnostic 
testing (the Clay Observation Survey of Early Literacy Achievement), 
and teacher recommendations. The goals of Reading Recovery® 



Research 



Four studies of Reading Recovery® met the What Works 
Clearinghouse (WWC) evidence standards, and one study met 
WWC evidence standards with reservations. These five studies 
included about 700 first-grade students attending elementary 
schools in diverse settings across the United States. All studies 



focused on low-achieving students who received the Reading 
Recovery® intervention in first grade. Generally, outcomes at 
the end of first grade were used by the WWC to calculate a 
rating of effectiveness. 1 In one study, longer range effects were 
included. 2 



Effectiveness 



Reading Recovery® was found to have positive effects on students’ alphabetics skills and general reading achievement outcomes. 
The program was found to have potentially positive effects on comprehension and fluency. 
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1. The evidence presented in this report is based on available research. Findings and conclusions may change as new research becomes available. 

2. Additional findings on outcomes measured at later time points are shown in Appendix A4.4. 

3. These numbers show the average and range of improvement indices for all findings across the studies. 
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Developer and contact 

Developed by Dr. Marie M. Clay, University of Auckland, New 
Zealand. Distributed through the Reading Recovery ® Council 
of North America (RRCNA). Address: 400 West Wilson Bridge 
Road, Suite 250, Worthington, OH 43085-5218. Email: jjohnson@ 
readingrecovery.org. Web: http://www.readingrecovery.org. 
Telephone: (614)310-7323. 

Scope of use 

Reading Recovery ® was developed in the mid-1970s by Dr. Clay, 
who first tested the program in New Zealand. According to the 
RRCNA, more than 1.5 million first graders in 48 states and the 
Department of Defense Dependents Schools have been served 
in the United States since Reading Recovery ® was introduced in 
1984. Reading Recovery® is also used in New Zealand, Australia, 
Canada, and the United Kingdom. 

Teaching 

Lessons incorporate principles of early intervention and individual 
tutoring and include attention to phonological awareness, pho- 
nics/decoding skills, vocabulary/word work, fluency, and com- 
prehension. Each Reading Recovery ® lesson consists of reading 
familiar and novel stories, manipulating letters and words, and 
writing and assembling stories. Lessons are interactive between 
teacher and student, with the teacher carefully monitoring each 
child’s reading behavior. Reading Recovery ® lessons are dis- 
continued when children demonstrate the ability to consistently 
read at the average level for their grade— between weeks 12 and 
20 of the program. Those who make progress but do not reach 



average classroom performance after 20 weeks are referred for 
further evaluation and a plan for future action. Teacher training 
includes a one-year, university-based training program. 

Cost 

Reading Recovery® is available on a nonprofit, no royalty basis. 
Costs for the program involve start-up costs and ongoing costs. To 
establish a Reading Recovery® site— a district or group of districts 
representing multiple schools— a teacher leader must first be 
trained. This start-up cost includes paying salary, paying university 
tuition for the Reading Recovery ® coursework, and covering the 
costs of books and materials. Sites must also build a one-way mir- 
ror and sound system to monitor training for the teachers. In addi- 
tion to salary, travel, and program support costs for the teacher 
leader, costs for teachers include paying salaries and benefits for 
the time they dedicate to Reading Recovery ® and paying tuition for 
training. Books and materials for lessons and evaluation as well as 
ongoing professional development for both teacher leaders and 
teachers should also be figured into the costs. 

In addition to the teacher training described above, in 2006 
the cost of program materials was approximately $100 per 
student served (calculated by the RRCNA as an average over 
the past five years, 2002-06). Sites pay an annual data evalu- 
ation fee of $250 a site plus $3.50 per student served. Sites 
implementing the program also pay annual technical support 
fees, which vary by the university that provides the Reading 
Recovery® training. Because of the cost and staff needed for 
the intervention, a typical school with one Reading Recovery® 
teacher will serve 4 or 5 students a semester. 



The WWC reviewed 78 studies that examined the effects of Read- standards. One study (Iverson & Tunmer, 1993) met WWC stan- 

ing Recovery®. Four studies (Baenen, Bernhole, Dulaney, & Banks, dards with reservations. The remaining 73 studies did not meet 

1997; Pinnell, DeFord, & Lyons, 1988; Pinnell, Lyons, DeFord, WWC evidence screens. 4 

Bryk, & Seltzer, 1994; and Schwartz, 2005) met WWC evidence 

4. Because Reading Recovery ® is designed to improve the reading skills of low-achieving first-grade readers, the appropriate comparison groups for 
determining the intervention’s effectiveness are similar low-achieving first-grade readers who did not receive Reading Recovery®. Many of the studies 
screened did not meet evidence standards because they used inappropriate comparison groups, such as higher achieving first-grade readers, to draw 
conclusions about the effectiveness of the program. 
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Met evidence standards 

Baenen et al. (1997) was a randomized controlled trial that 
focused on first-grade students from Wake County, North 
Carolina. The WWC review focuses on the outcomes of students 
who qualified for and were randomly assigned to either the 
Reading Recovery ® intervention or a comparison group. From 
an original sample size of 168, outcomes were assessed at three 
time points: end of first grade (n = 147), end of second grade 
(n = 147), and end of third grade (n = 127). Although the WWC 
used only the results at the end of first grade to determine the 
intervention rating, information on the additional findings can be 
found in Appendix A4.4. 

Pinnell, DeFord, and Lyons (1988) was a randomized 
controlled trial. The study sample was first-grade students 
distributed across 14 schools in Columbus, Ohio. Two groups 
were formed by randomly assigning students to an intervention 
group, which received Reading Recovery® in addition to their 
regular classroom instruction (n = 38), or to a control group, 
which received an alternate compensatory program (n = 53). This 
comparison met WWC evidence standards. 5 

Pinnell et al. (1994) was a randomized controlled trial that 
randomly assigned 10 low-achieving first-grade students in each 



of 10 Ohio schools. The WWC review focuses only on the eight 
schools that successfully implemented randomization for the 
intervention (n = 31) and comparison (n = 48) conditions. 6 

Schwartz (2005) was a randomized controlled trial of first- 
grade students from 14 states. The WWC focused on the 37 
students across several schools who were randomly assigned 
to receive the intervention during the first half of the year. The 
other 37 students, who were randomly assigned to receive the 
intervention during the second half of the year, served as the 
comparison group during the first half of the year. 7 The groups 
were compared at mid-year, before the comparison group had 
begun receiving Reading Recovery®. 

Met evidence standards with reservations 

Iverson and Tunmer (1993) was a quasi-experimental design 
study that included first-grade students from 30 school districts 
in Rhode Island. The study compared outcomes for students 
participating in Reading Recovery ® (n = 32) with students in 
a comparison group who did not receive Reading Recovery® 

(n = 32), who were matched on the basis of pretest scores. 8 The 
comparison group received standard small group, out-of-class 
support services. 



5. A third group of students qualified for and received Reading Recovery ® outside of regular classroom instruction, but were also taught by a Reading 
Recovery®- trained teacher when they were in their regular classroom (n = 96). Although this comparison met evidence standards with reservations, it 
was not considered in the intervention rating because it went beyond the standard delivery of the program. However, results are reported in Appendices 
A4.1-A4.3. 

6. Although the original study included analyses of additional interventions implemented at additional schools, only the schools that randomly assigned 
students to Reading Recovery ® or the comparison group were relevant to this review. For more details about the original study, see Appendix A1.3. 

7. Assessments were also made at the end of the year, but they were not appropriate for the WVVC’s analysis because by then both groups of low-achiev- 
ing students had received the intervention. Additional comparison groups of low-average and high-average readers were not used by the WWC because 
these students were not eligible for Reading Recovery ®. 

8. The study also included a third group of students (n = 32) who used a modified version of Reading Recovery ®, which provided explicit instruction in let- 
ter-phoneme patterns instead of the letter identification segment. This group was also compared with the comparison group. Although this comparison 
met evidence standards with reservations, it was not considered in the intervention rating because it went beyond the standard delivery of the program. 
However, results are reported in Appendices A4.1 and A4.3. 
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Findings 

The WWC review of interventions for beginning reading 
addresses student outcomes in four domains: alphabetics, read- 
ing fluency, comprehension, and general reading achievement. 9 
Reading Recovery ® studies included in this report cover all four 
domains and most of the constructs within each domain. The 
findings below present the authors’ estimates and WWC-calcu- 
lated estimates of the size and the statistical significance of the 
effects of Reading Recovery ® on students. The results are pre- 
sented by domain and construct for all the Reading Recovery® 
studies that the WWC reviewed. 

For the four beginning reading domains, subtests of the Clay 
Observation Survey were used in some of the studies. The Clay 
Observation Survey was developed by Dr. Marie Clay, who also 
developed Reading Recovery®. 

Alphabetics. Two studies examined the effects of Reading 
Recovery® on the phonemic awareness construct. Schwartz 
(2005) reported no statistically significant effects for the pho- 
nemic awareness measures— the deletion task and the Yopp- 
Singer Phoneme Segmentation Test— but the effects on both 
measures were positive and considered substantively important 
based on the WWC criteria (that is, at least 0.25). Iverson and 
Tunmer (1993) reported, and the WWC confirmed, statistically 
significant positive effects of the Reading Recovery® intervention 
on two phonemic awareness measures— a phoneme deletion 
task and the Yopp-Singer Phoneme Segmentation Test. 

Three studies examined the effects of Reading Recovery® on 
the print awareness construct in the alphabetics domain. Pinnell, 
DeFord, and Lyons (1988) reported, and the WWC confirmed, 
a statistically significantly positive effect of Reading Recovery® 
on the Concepts about Print subtest of the Observation Survey 
of Early Literacy Achievement. Schwartz (2005) reported, and 
the WWC confirmed, a statistically significant positive effect 
of Reading Recovery® on the Concepts about Print subtest of 



the Observation Survey. Iverson and Tunmer (1993) found a 
statistically significant positive effect of Reading Recovery® on 
the Concepts about Print subtest of the Observation Survey. The 
significance of the effect was confirmed by the WWC. 

Three studies examined the effects of Reading Recovery® 
on the letter knowledge construct in the alphabetics domain. 
Pinnell, DeFord, and Lyons (1988) did not find a statistically sig- 
nificant effect for Reading Recovery® on the Letter Identification 
subtest of the Observation Survey. Schwartz (2005) reported 
a statistically significant positive effect of Reading Recovery® 
on the Letter Identification subtest of the Observation Survey, 
but according to WWC criteria this effect was not statistically 
significant or large enough to be considered substantively 
important. 10 Iverson and Tunmer (1993) found, and the WWC 
confirmed, a statistically significant positive effect of Reading 
Recovery® on the Letter Identification subtest of the Observa- 
tion Survey. 

Three studies examined the effects of Reading Recovery® 
on the phonics construct of the alphabetics domain. Pinnell, 
DeFord, and Lyons (1988) found a statistically significant positive 
effect on the Word Recognition subtest of the Observation Sur- 
vey. In WWC calculations, there was no statistically significant 
effect, but the positive effect was large enough to be considered 
substantively important. Schwartz (2005) found, and the WWC 
confirmed, a statistically significant positive effect of Reading 
Recovery® on the Word Recognition subtest of the Observation 
Survey. Iverson and Tunmer (1993) found statistically significant 
positive effects of Reading Recovery® on the Dolch Word Rec- 
ognition Test, the Word Recognition subtest of the Observation 
Survey, and a pseudoword decoding task. The significance of 
the effects was confirmed by the WWC. 

Overall, in the alphabetics domain, two studies with strong 
designs met WWC evidence standards and demonstrated 
statistically significant positive effects. One additional study met 



9. For definitions of the domains, see the Beginning Reading Protocol . 

10. In this case, the author did not control for pretest differences between groups; however, the WWC did account for pretest differences. 
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WWC evidence standards with reservations and showed statisti- 
cally significant positive effects. 

Fluency. Schwartz (2005) found, and the WWC confirmed, 
positive and statistically significant effects of Reading Recovery ® 
on the Slosson Oral Reading Test-Revised and the Text Reading 
Level subtest of the Observation Survey. 

In the fluency domain, there was one study with a strong 
design that demonstrated statistically significant positive effects. 

Comprehension. Two studies examined the effects of Reading 
Recovery ® on the reading comprehension construct. Pinnell, 
DeFord, and Lyons (1988) found a positive and statistically 
significant effect of Reading Recovery ® on the Reading Com- 
prehension subtest of the Comprehensive Test of Basic Skills 
(CTBS). The significance of the effect was confirmed by the 
WWC. Schwartz (2005) reported no statistically significant effect 
of Reading Recovery ® on the Degrees of Reading Power Test. 

One study examined the effect of Reading Recovery ® on the 
vocabulary construct of the comprehension domain. Pinnell, 
DeFord, and Lyons (1988) found, and the WWC confirmed, a 
positive and statistically significant effect of Reading Recovery® 
on the Reading Vocabulary subtest of the CTBS. 

In the comprehension domain, there were two studies with 
strong designs. One study showed statistically significant posi- 
tive effects, and the other study showed an indeterminate effect. 

General reading achievement. Baenen et al. (1997) did not find 
a statistically significant effect of Reading Recovery ® on grade 
retention. Pinnell, DeFord, and Lyons (1988) found, and the WWC 
confirmed, positive and statistically significant effects of Reading 



Recovery ® on two subtests of the Observation Survey: Hearing 
and Recording Sounds in Words (Dictation) and Writing Vocabu- 
lary. Pinnell et al. (1994) found statistically significant positive 
effects of Reading Recovery ® on the Gates-MacGinitie, the 
Dictation subtest of the Observation Survey, and the Woodcock 
Reading Mastery Test-Revised. The statistical significance of the 
effects was confirmed by the WWC. Schwartz (2005) found, and 
the WWC confirmed, positive and statistically significant effects 
of Reading Recovery ® on two subtests of the Observation Sur- 
vey: Dictation and Writing Vocabulary. Iverson and Tunmer (1993) 
found, and the WWC confirmed, statistically significant positive 
effects of Reading Recovery ® on two subtests of the Observa- 
tion Survey: Dictation and Writing Vocabulary. 

In the general reading achievement domain, there were three 
studies with strong designs and statistically significant positive 
effects. One study had a strong design with indeterminate effects. 
One additional study met WWC evidence standards with reserva- 
tions and demonstrated statistically significant positive effects. 

Rating of effectiveness 

The WWC rates the effects of an intervention in a given outcome 
domain as positive, potentially positive, mixed, no discernible 
effects, potentially negative, or negative. The rating of effective- 
ness takes into account four factors: the quality of the research 
design, the statistical significance of the findings, 11 the size of 
the difference between participants in the intervention and the 
comparison conditions, and the consistency in findings across 
studies (see the WWC Intervention Rating Scheme) . 



11. The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within 

classrooms or schools and for multiple comparisons. For an explanation, see the WWC Tutorial on Mismatch . See Technical Details of WWC-Conducted 
Computations for the formulas the WWC used to calculate the statistical significance. In this report, such adjustments were made for Pinnell, DeFord, 
and Lyons (1988); Iverson and Tunmer (1993); and Pinnell et al. (1994). 
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Improvement index 

The WWC computes an improvement index for each individual 
finding. In addition, within each outcome domain, the WWC 
computes an average improvement index for each study and 
an average improvement index across studies (see Technical 
Details of WWC-Conducted Computations) . The improvement 
index represents the difference between the percentile rank 
of the average student in the intervention condition versus 
the percentile rank of the average student in the comparison 
condition. Unlike the rating of effectiveness, the improvement 
index is entirely based on the size of the effect, regardless of 
the statistical significance of the effect, the study design, or the 
analyses. The improvement index can take on values between 
-50 and +50, with positive numbers denoting results favorable to 
the intervention group. 

The average improvement index for alphabetics is +34 
percentile points across three studies, with a range of -10 to +50 
percentile points. For fluency, the average improvement index 



is +46 percentile points, with a range of +32 to +49 percentile 
points across outcomes in one study. For comprehension, the 
average improvement index is +14 percentile points across two 
studies, with a range of +6 to +21 percentile points. For the 
general reading domain, the average improvement index was 
+32 percentile points across five studies, with a range of -5 to 
+50 percentile points. 

Summary 

The WWC reviewed 78 studies that investigated the effects of 
Reading Recovery®. Four studies met WWC evidence standards 
and one met WWC evidence standards with reservations. Based 
on these five studies, the WWC found Reading Recovery® to 
have positive effects in the alphabetics and general reading 
achievement domains. Reading Recovery ® was found to have 
potentially positive effects in the fluency and comprehension 
domains. The evidence presented in this report may change as 
new research emerges. 
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12. The sample is not appropriate to this review: the parameters for this WWC review specified that students should be in grades kindergarten through 3; 
this study did not disaggregate students in the eligible range from those outside the range. 

13. Does not use a strong causal design: this study was a quasi-experimental design but did not use achievement pretests to establish that the comparison 
group was equivalent to the intervention group at baseline. 

14. Incomparable groups: this study was a quasi-experimental design that used achievement pretests but it did not establish that the comparison group 
was comparable to the treatment group prior to the start of the intervention. 

15. Does not use a strong causal design: this study did not use a comparison group. 

16. Does not use a strong causal design: in this study, which used a quasi-experimental design, data were presented only for students who successfully 
completed the intervention, which does not provide a direct test of the intervention as a whole. 

17. Does not use a strong causal design: this study, which used a quasi-experimental design, had a confounding factor. The Reading Recovery intervention 
was used without proper Reading Recovery materials, and the instructors had not been fully trained. This makes it difficult to attribute study outcomes 
to Reading Recovery. 

18. The sample is not appropriate to this review: this study did not focus on students learning to read in English, one of the parameters for this WWC review. 

19. Does not use a strong causal design: this is a qualitative study. 

20. Complete data are not reported: the WWC could not evaluate the design because complete data were not reported. Attempts to contact the authors for 
more information were unsuccessful. 

21. High overall attrition: this study reported an attrition rate of greater than 20 percentage points. 

22. The sample is not appropriate to this review: the parameters for this WWC review specified student outcome measures, but this study did not focus on 
students. 

23. Disruption: this study, which used a quasi-experimental design, exhibited disruption problems that made it difficult to attribute study outcomes to the 
intervention, as delivered. 

24. Does not use a strong causal design: this study, which used a quasi-experimental design, tested only a portion of the Reading Recovery curriculum, 
making it difficult to attribute study outcomes to Reading Recovery. 
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25. Does not use a strong causal design: for the portion of the sample of interest for this WWC review, there was one of two issues. Either there was a 
confound, with the Reading Recovery intervention being modified or combined with other interventions, making it difficult to attribute study outcomes to 
the intervention. Or the study did not establish that the comparison group was comparable to the treatment group prior to the start of the intervention. 

26. The outcome measures are not relevant to this review: the outcomes in this study did not address one of the domains of interest in this review. 
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Appendix 



Appendix A1.1 Study characteristics: Baenen, Bernhole, Dulaney, & Banks, 1997 (randomized controlled trial) 



Characteristic 


Description 


Study citation 


Baenen, N., Bernhole, A., Dulaney, C., & Banks, K. (1997). Reading Recovery: Long-term progress after three cohorts. Journal of Education for Students Placed at Risk, 2(2), 161. 


Participants 


The study involved 772 first-grade students who were studied in four cohorts between 1990 and 1994. Of this total number, only one cohort (who received treatment in 
1990-91 at 10 schools 1 ) with 168 students met WWC criteria for inclusion. After 1990-91, the comparison group was made up of students who were not comparable to the 
intervention group in terms of their achievement level — they were not the lowest achieving students in participating schools, as the Reading Recover >® students were. Due to 
attrition, the author's final analysis sample included 147 first-grade students in the 1990-91 cohort. 2 All 147 students were followed longitudinally into second grade; 127 of 
these students were followed and included in the third-grade analysis. No information was reported regarding ethnicity or gender, but all students were low-achieving and a 
high percentage of students qualified for free or reduced-price lunch (Baenen, personal communication, January 9, 2006). 


Setting 


The study took place in 10 elementary schools in the Wake County public schools in North Carolina. 


Intervention 


The intervention group was originally composed of 84 students who qualified for Reading Recoverf on the basis of three subtests from the Observation Survey of Early Lit- 
eracy Achievement — Text Reading Level (running record), Dictation, and Writing Vocabulary. Intervention students, among the lowest achieving students at their schools, were 
randomly assigned to receive the Reading Recoverf 1 intervention. They were taught by one of 12 teachers. Results were presented for 72 students for first-and second-grade 
analyses. Of these, 27% of the students received a “partial program’’ instead of the full set of more than 60 lessons in Reading Recoverf. 


Comparison 


The comparison group was composed of 84 students who qualified for Reading Recoverf on the basis of three subtests from the Observation Survey of Early Literacy 
Achievement — Text Reading Level (running record), Dictation, and Writing Vocabulary. These students, among the lowest achieving students at their schools, were randomly 
assigned to a waitlist for Reading Recoverf intervention. They did not receive Reading Recoverf during the time of the study but received the regular services available to 
them. Results were presented for 75 students for first- and second-grade analyses and 68 students for third-grade analysis. 


Primary outcomes 
and measurement 


Grade retention was measured at the end of first and second grade, and the North Carolina End-of-Grade test in reading was used at the end of third grade. The authors 
measured referral to special education and Title 1 services, but the outcomes are not included in this report because these types of outcomes are not specified by the WWC 
Beginning Reading Protocol. The authors used three subtests of the Observation Survey of Early Literacy Achievement, but these measures are not included in the review 
because collection of these data did not focus on the full sample of students. The authors also used a measure to gauge teacher perception of student achievement, but it is 
not included in this report because the WWC focuses on direct student achievement measures only. (See Appendix A2.4 for more detailed descriptions of outcome measures.) 


Teacher training 


Reading Recoverf teachers participated in a graduate-level course that included 33 training sessions from September to June of the intervention year. The 1990-91 school 
year was the first year of implementing Reading Recoverf for the district; therefore it was the first year the teachers taught the intervention. 



1. One of the manuscripts related to this study reported 11 schools. 

2. Pretest comparability of the students was demonstrated for 146 students. It is reasonable to assume that the pretest mean for the 146 students would be similar to that based on 147 students. 
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Appendix A1.2 Study characteristics: Pinnell, DeFord, & Lyons, 1988 (randomized controlled trial) 



Characteristic 


Description 


Study citation 


Pinnell, G. S., DeFord, D. E., & Lyons, C. A. (1 988). Reading Recovery: Early intervention for at-risk first graders (Educational Research Service Monograph). Arlington, VA: 
Educational Research Service. 


Participants 


The study involved 187 first-grade students from 14 schools. While information about the specific schools included in the study was not presented, the district has a racial 
composition of 45% nonwhite students and a gender composition of 51% male students. The city had a mobility rate of approximately 20%, with 66% of students receiving 
free or reduced-price lunch. Students in the study were low achieving, as defined by scoring in the lowest 20% of their class on reading measures. Due to attrition, results 
were reported for 184 students. 


Setting 


The study took place in 14 urban public schools in Columbus, Ohio. 


Intervention 


There were two intervention groups with a total of 134 students taught by 32 teachers. One group was randomly assigned to receive the standard Reading Recoverf pull-out 
program (n = 38). These students had regular classroom teachers who were not trained in Reading Recoverf. This group experienced attrition of one student. A second 
group of students determined to be eligible for Reading Recover >® received the standard Reading Recover y® pull-out program, with the addition of having regular classroom 
teachers trained in Reading Recoverf (n = 96). The second group was not randomly assigned to Reading Recoverf 1 nor to their classroom teacher, so this portion of the 
study is considered a quasi-experimental design. It is not included in the intervention rating because the second intervention group with a Reading Recoverf 1 - trained teacher 
as regular classroom teacher goes beyond the standard implementation of the program. 1 


Comparison 


The comparison group was composed of low-achieving students who did not have regular classroom teachers trained in Reading Recoverf and who were randomly assigned 
to an alternative compensatory program (n = 53) (Pinnell, personal communication, September 9, 2006). Students in this group received from a trained paraprofessional a 
series of skill-oriented drill activities conducted in small groups or individual sessions of 30-45 minutes. The comparison group was similar to both intervention groups on 
pretest measures (Pinnell, personal communication, December 5, 2005). The final analysis sample included 51 students. 


Primary outcomes 
and measurement 


Five subtests of the Observation Survey of Early Literacy Achievement were included — Letter Identification, Word Recognition, Concepts about Print, Writing Vocabulary, 
and Dictation. Additional measures included a writing assessment, the Reading Vocabulary subtest of the Comprehensive Test of Basic Skills (CTBS), and the Reading Com- 
prehension subtest of the CTBS. Results from the Observation Survey: Text Reading Level subtest were not reported because effect sizes could not be calculated that were 
comparable to other measures. 2 (See Appendices A2.1-A2.4 for more detailed descriptions of outcome measures.) 


Teacher training 


Teachers received a full year of special training during which they practiced teaching and observed other teachers through a one-way mirror. Twelve teachers received training 
from a university program and were in their second year of teaching the intervention during the time of the study. The other 20 teachers received training from a local teacher 
leader and were in their first year of teaching the intervention during the time of the study. 



1. Results are reported in Appendices A4.1-4.3. 

2. Findings based on the Observation Survey: Text Reading Level subtest are not included in the effectiveness ratings because effect sizes and the statistical significance of the findings could not 
be calculated given the information provided in the study. The Observation Survey: Text Reading Level subtest is reported as reading levels based on ordinal, rather than equal-interval, scales. 
For example, the increase in fluency measured by scoring at level 3 compared with level 2 on the scale may not be equal to the increase in fluency as measured by scoring at level 24 compared 
with level 23. The author no longer had information on the number of students scoring at each level. For more detail, see Denton, C. A., Ciancio, D. J., & Fletcher, J. M. (2006). Validity, reliability, 
and utility of the Observation Survey of Early Literacy Achievement. Reading Research Quarterly , 47(1), 8-34. 
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Appendix A1.3 Study characteristics: Pinnell, Lyons, DeFord, Bryk, & Seltzer, 1994 (randomized controlled trial) 



Characteristic 


Description 


Study citation 


Pinnell, G. S., Lyons, C. A., DeFord, D. E., Bryk, A. S., & Seltzer, M. (1994). Comparing instructional models for the literacy education of high-risk first graders. Reading 
Research Quarterly, 29( 1), 8-39. 


Participants 


The study was designed to examine outcomes of 403 first-grade students distributed across 43 schools. Percentages of children receiving Aid to Dependent Children ranged 
between 9% and 42% among districts. For this report, the WWC looked at results for students in 10 districts at schools that were using Reading Recover f. In those schools, 
eligible students were randomly assigned to the Reading Recoverf group or the comparison group, which did not receive any special instruction. The WWC did not look at the 
portions of the study that examined results for students who received three additional reading interventions — Reading Success, Direct Instruction Skills Plan, and Reading and 
Writing Group. These were delivered at schools other than the Reading Recoverf schools and were compared with comparison students in those schools. In the original study 
design, 100 students were randomly assigned to receive Reading Recoverf 1 or the comparison condition at 10 schools. However, random assignment was not successfully 
implemented at two schools and there was minor attrition at the remaining schools, resulting in a final analytic sample of 79 students from eight schools. Treatment and 
comparison students attended the same schools. All students were low achieving. 


Setting 


The study took place in 10 school districts (two rural, two suburban, and six urban) in Ohio. 


Intervention 


The intervention group was composed of 31 low-achieving students from one school in each of the districts that already had a Reading Recoverf program in place. Interven- 
tion students participated in the Reading Recoverf curriculum. 


Comparison 


The comparison group included 48 students who were randomly assigned to receive no special instruction, but continued to participate in their regular reading program. 
These students attended the same schools as the intervention students. 


Primary outcomes 
and measurement 


The Dictation subtest of the Observation Survey of Early Literacy Achievement was administered in addition to the Woodcock Reading Mastery Test-Revised and the Gates- 
MacGinitie Reading Test. Results from the Observation Survey: Text Reading Level subtest were not reported because effect sizes that were comparable to other measures 
could not be calculated. 1 (See Appendices A2.2 and A2.4 for more detailed descriptions of outcome measures.) 


Teacher training 


At least two years prior to the study, Reading Recoverf teachers received specialized training — weekly 2.5 hour sessions for one year — during which they practiced teaching 
and observed other teachers through a one-way mirror. 



1. Findings based on the Observation Survey: Text Reading Level subtest are not included in the effectiveness ratings because effect sizes and the statistical significance of the findings could not 
be calculated given the information provided in the study. The Observation Survey: Text Reading Level subtest is reported as reading levels based on ordinal, rather than equal-interval, scales. 
For example, the increase in fluency measured by scoring at level 3 compared with level 2 on the scale may not be equal to the increase in fluency as measured by scoring at level 24 compared 
with level 23. The author no longer had information on the number of students scoring at each level. For more detail, see Denton, C. A., Ciancio, D. J., & Fletcher, J. M. (2006). Validity, reliability, 
and utility of the Observation Survey of Early Literacy Achievement. Reading Research Quarterly ; 47(1), 8-34. 
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Appendix A1.4 Study characteristics: Schwartz, 2005 (randomized controlled trial) 



Characteristic 


Description 


Study citation 


Schwartz, R. M. (2005). Literacy learning of at-risk first-grade students in the Reading Recovery early intervention. Journal of Educational Psychology, 97(2), 257-267. 


Participants 


The study was designed to examine the outcomes of first-grade students distributed across 47 Reading Recoverf teachers. Each teacher identified for random assignment 
two students eligible for Reading Recoverf based on their low scores on the Observation Survey of Early Literacy Achievement. These 94 students were randomly assigned to 
enter the Reading Recoverf program during either the first or second half of the school year. The study included two additional comparison groups of 47 low-average and 47 
high-average readers from the same classrooms as the Reading Recoverf students who were not expected to participate in the Reading Recoverf program. Analysis involv- 
ing these comparison groups was not eligible for WWC review because the WWC considers only comparisons of students with similar achievement backgrounds in assessing 
the effectiveness of Reading Recover >®. Because of missing testing data, the author's final analysis of the randomly assigned groups included 74 students distributed across 
37 teachers. 


Setting 


The study took place in elementary schools in 14 states. 


Intervention 


The intervention group was composed of the lowest 20%— 30% of students who qualified for Reading Recoverf based on the six subtests of the Observation Survey and on 
judgments by teachers. All teachers agreed to dedicate one of their four Reading Recoverf slots to the study, allowing random assignment of one of two eligible students to 
receive intervention delivery in the first half of the year. Thus students in the intervention group were randomly assigned to receive Reading Recoverf during the first half of 
the year. Originally, participants were taught by 47 Reading Recoverf teachers who had volunteered to be part of the study, but because of missing information, data from 
only 37 teachers and 37 students were included in the author’s final analysis. The intervention group was 61% male, 38% white, 47% black, 12% Hispanic, and 3% Asian. 
About 60% of the group received free or reduced-price lunch. 


Comparison 


The comparison group of interest to the WWC was composed of the lowest 20%— 30% of students who qualified for Reading Recoverf based on six subtests of the Observa- 
tion Survey and on judgments by teachers. Participants in the comparison group were randomly assigned to receive Reading Recoverf during the second half of the year. 
Thus these participants served as comparisons only during the first part of the year when they received instruction in their regular classroom only. Data from 37 teachers and 
37 students were included in the author’s final analysis. The comparison group was 41% male, 47% white, 38% black, and 15% Hispanic. Approximately 57% of the group 
received free or reduced-price lunch. 


Primary outcomes 
and measurement 


Six subtests of the Observation Survey of Early Literacy Achievement were administered — Letter Identification, Word Recognition, Concepts about Print, Writing Vocabulary, 
Dictation, and Text Reading Level. 1 Other assessments included the Yopp-Singer Phoneme Segmentation Task, the deletion task, the Slosson Oral Reading Test-Revised, and 
the Degrees of Reading Power Test. (See Appendices A2.1-A2.4 for more detailed descriptions of outcome measures.) 


Teacher training 


No information on training for the specific teachers participating in this study was provided, but Reading Recove rf teachers typically undergo graduate-level training and must 
implement the program under the supervision of a teacher leader. 



1. For many studies, findings based on the Observation Survey: Text Reading Level subtest were not included in the effectiveness ratings because effect sizes and the statistical significance of 
the findings could not be calculated given the information provided in the study. In this case, however, the author was able to provide the number of students scoring at each level (Schwartz, 
personal communication, December 5, 2006). The outcome was calculated as a logged-odds ratio, with the number of students reading at or above the first-grade level compared with students 
reading below the first-grade level. 
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Appendix A1.5 Study characteristics: Iverson & Tunmer, 1993 (quasi-experimental design) 



Characteristic 


Description 


Study citation 


Iverson, S., &Tunmer, W. E. (1993). Phonological processing skills and the Reading Recovery program. Journal of Educational Psychology, 85(1), 112-126. 


Participants 


The study included 160 first-grade students distributed across 47 classrooms in 30 schools. Ninety-six students who performed at the lowest levels on tests administered at 
the beginning of the first grade were placed into one of three matched groups: a standard Reading Recoverf group, a modified Reading Recoverf group, and a comparison 
group that did not participate in Reading Recoverf. Students across the three groups were matched based on pretest scores from the Letter Identification and Dictation sub- 
tests of the Observation Survey. The study also compared the Reading Recoverf groups with comparison groups consisting of 64 children from the same classrooms as the 
Reading Recover y® participants who were deemed by the classroom teachers to be performing at average reading level. But analyses involving these comparison groups were 
not eligible for this WWC review because the WWC considers only comparisons of students with similar achievement backgrounds to provide information on the effectiveness 
of Reading Recover f. 


Setting 


Thirty schools from 13 school districts in Rhode Island. 


Intervention 


The study authors used two intervention groups drawn from the 23 schools operating Reading Recover f programs. Sixty-four low-scoring students from 34 classrooms 
were assigned to receive standard or modified Reading Recoverf \ The 32 students in the standard Reading Recoverf group received one-on-one lessons that followed the 
procedures described by the program developer, which included a letter identification segment as needed. Once a child mastered letter identification, time allocated to this 
segment could be used for incidental word analysis in other segments. For the 32 students in the modified Reading Recoverf 1 program, explicit instruction in letter-phoneme 
patterns replaced the letter identification segment beginning in the fourth week of the program. The WWC did not include the second comparison group in its intervention 
ratings because it was a modified version of the standard program. 1 


Comparison 


The matched comparison group of interest to the WWC was drawn from the seven schools that did not operate Reading Recoverf programs. Thirty-two low-scoring students 
from 13 classrooms were matched to the intervention groups based on pretest scores. Students in the comparison group received the support services normally available to 
at-risk readers, which consisted of small-group instruction provided at least four days a week through either the federally funded Chapter One program or the state-supported 
literacy program. 1 


Primary outcomes 
and measurement 


Five subtests of the Observation Survey of Early Literacy Achievement were included — Letter Identification, Word Recognition, Concepts about Print, Writing Vocabulary, and 
Dictation. Additional measures included the Dolch Word Recognition Test, Yopp-Singer Phoneme Segmentation Test, a phoneme deletion task, and a pseudoword decoding 
task. Results from the Observation Survey: Text Reading Level subtest were not reported because effect sizes that were comparable to other measures could not be calcu- 
lated. 2 (See Appendices A2.1, A2.2, and A2.4 for more detailed descriptions of outcome measures.) 


Teacher training 


Reading Recoverf teachers in the study were reading specialists with master’s degrees. To learn how to administer the assessments, teachers participated in four two-hour 
training sessions. To learn how to deliver the intervention, teachers attended weekly two-hour classes. During the session, teachers were able to practice teaching and 
observe other teachers through a one-way mirror. They also received feedback from the teacher leader during individual visits to the classes. 



1. Results are reported in Appendices A4.1 and A4.3. 

2. Findings based on the Observation Survey: Text Reading Level subtest are not included in the effectiveness ratings because effect sizes and the statistical significance of the findings could not 
be calculated given the information provided in the study. The Observation Survey: Text Reading Level subtest is reported as reading levels based on ordinal, rather than equal-interval, scales. 
For example, the increase in fluency measured by scoring at level 3 compared with level 2 on the scale may not be equal to the increase in fluency as measured by scoring at level 24 compared 
with level 23. The author no longer had information on the number of students scoring at each level. For more detail, see Denton, C. A., Ciancio, D. J., & Fletcher, J. M. (2006). Validity, reliability, 
and utility of the Observation Survey of Early Literacy Achievement. Reading Research Quarterly , 47(1), 8-34. 
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Appendix A2.1 Outcome measures in the alphabetics domain by construct 



Outcome measure 


Description 


Phonemic awareness 
Deletion task 


A 10-item version of the Rosner task that requires students to repeat a word and then say it again after omitting a given syllable or sound. The assessment is not standardized 
(as cited in Schwartz, 2005). 


Phoneme deletion task 


Developed by Calfee, this measure requires students to delete the first consonant segment in a word and read the word aloud. It includes four training lists and six transfer 
lists that increase in difficulty as students move through each list. The most difficult lists consist of both real words and pseudowords. Student scores are based on the 
number of correct words read from the transfer lists (as cited in Iverson and Tunmer, 1993). 


Yopp-Singer Phoneme 
Segmentation Test 

Print awareness 


Developed by Yopp, the test is an orally administered assessment. A teacher works with each student individually and introduces the test as a word game. The teacher has a 
list of 22 words that the student is not allowed to see. After the teacher reads each word, the student must repeat all of the sounds in the word separately and slowly (as cited 
in Iverson and Tunmer, 1993; Schwartz, 2005). 


Observation Survey 
of Early Literacy 
Achievement: Concepts 
about Print subtest 1 

Letter knowledge 


Students perform tasks related to printed language concepts (for example, directionality and word concepts) while reading a book. This assessment, developed by Clay, is not 
standardized (as cited in Iverson and Tunmer, 1993; Pinnell, DeFord, and Lyons, 1988; Schwartz, 2005). 


Observation Survey of Early 
Literacy Achievement: Letter 
Identification subtest 1 

Phonics 


Students identify upper- and lowercase letters. This assessment, developed by Clay, is not standardized (as cited in Iverson and Tunmer, 1993; Pinnell, DeFord, and Lyons, 
1988; Schwartz, 2005). 


Dolch Word Recognition Test 


A standardized, individually administered word recognition test that contains a list of 220 short, frequently occurring words. Students are asked to read aloud from the list of 
words to measure their automatic word recognition and word retrieval skills, not their ability to sound out words (as cited in Iverson and Tunmer, 1993). 


Observation Survey of Early 
Literacy Achievement: 

Word Recognition subtest 
(also known as Ready to 
Read or Ohio Word Test ) 1 


Students read 20 common sight words from basic reading texts, and their accuracy is scored. This assessment, developed by Clay, is not standardized (as cited in Iverson and 
Tunmer, 1993; Pinnell, DeFord, and Lyons, 1988; Schwartz, 2005). 


Pseudoword decoding task 


A 40-item test that requires students to name single-syllable synthetic words that conform to the rule of English orthography. The assessment is not standardized (as cited in 
Iverson and Tunmer, 1993). 



1. The Clay Observation Survey was developed by Dr. Marie Clay, who also developed Reading Recovery ®. Although there is no evidence of obvious overalignment between the measure and the 
intervention (intervention student receiving exposure to the measure during the course of treatment), it should be noted that the developer of the intervention and the measure were the same. 
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Appendix A2.2 Outcome measures in the fluency domain 



Outcome measure 


Description 


Observation Survey of 
Early Literacy Achievement: 
Text Reading Level subtest 1 ’ 2 


The percentage of students scoring at the first-grade reading level or higher compared with those scoring lower than first grade. To determine this, students read from pas- 
sages of increasing difficulty, and student’s error rate and self-correcting behavior were recorded using the running record technique. Students read from leveled texts drawn 
from a basal reading series until their accuracy rate fell below 90%. Results were translated to a numerical reading level from level 1 to level 30, which in turn match up to 
grade-level equivalency. This assessment method is not standardized (as cited in Schwartz, 2005). 


Slosson Oral Reading 
Test-Revised (S0RT-R3) 


Developed by Nicholson, the measure consists of 200 words arranged in order of difficulty, with 20 words per list. Each list represents an approximate reading grade level 
(for example, list one is equivalent to first grade). Administration ends after all the words on one list are missed. The measure is standardized and norm-referenced (as cited in 
Schwartz, 2005). 



1. The Clay Observation Survey was developed by Dr. Marie Clay, who also developed Reading Recovery®. Although there is no evidence of obvious overalignment between the measure and the 
intervention (intervention student receiving exposure to the measure during the course of treatment), it should be noted that the developer of the intervention and the measure were the same. 

2. Findings based on the Observation Survey: Text Reading Level subtest are not included in the effectiveness ratings for most studies because effect sizes and the statistical significance of the 
findings could not be calculated given the information provided in the studies. The author of one study (Schwartz, 2005) was able to provide the percentage of students scoring at each level. The 
outcome was changed to a logged-odds ratio for students scoring at the first-grade level or higher compared with those scoring lower than first grade. 

Appendix A2.3 Outcome measures in the comprehension domain by construct 



Outcome measure 


Description 


Reading comprehension 

Comprehensive Test of 
Basic Skills (CTBS): Reading 
Comprehension subtest 


A group-administered, standardized assessment of reading comprehension (as cited in Pinnell, DeFord, and Lyons, 1988). 


Degrees of Reading 
Power Test 

Vocabulary development 


An untimed standardized test requiring students to read a nonfiction passage with a word or set of words missing. Students select an appropriate answer to complete the 
sentence from a set of four or five alternatives (as cited in Schwartz, 2005). 


CTBS: Reading 
Vocabulary subtest 


A group-administered, standardized assessment of vocabulary (as cited in Pinnell, DeFord, and Lyons, 1988). 
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Appendix A2.4 Outcome measures in the general reading achievement domain 



Outcome measure 


Description 


Gates-MacGinitie (1978) 


A standardized test that covers vocabulary and comprehension aspects of reading. It evaluates students’ abilities to decode initial consonants, consonant clusters, final conso- 
nants, and vowels in real English words and also measures their ability to recognize commonly used words without decoding. For reading comprehension, answer choices are 
given as pictures and words (as cited in Pinnell et al., 1994). 


North Carolina End-of- 
Grade reading test 


A standardized state assessment designed to match the North Carolina curriculum. It uses multiple-choice questions with reading passages and is designed to measure read- 
ing and thinking skills (as cited in Baenen et al., 1997). 


Observation Survey of Early 
Literacy Achievement: 
Hearing and Recording 
Sounds in Words 
(Dictation) subtest 1 


Students write the words that are dictated to them in sentence form. This assessment, developed by Clay, is not standardized (as cited in Iverson and Tunmer, 1993; Pinnell, 
DeFord, and Lyons, 1988; Pinnell et al., 1994; Schwartz, 2005). 


Observation Survey of Early 
Literacy Achievement: 
Writing Vocabulary subtest 1 


Students are given 10 minutes to write as many words as they can on a blank sheet of paper. If needed, a standard set of prompts is used to encourage additional attempts to 
write. The measure is scored by counting the number of correctly spelled words (as cited in Iverson and Tunmer, 1993; Pinnell, DeFord, and Lyons, 1988; Schwartz, 2005). 


Retention 


The percentage of students who were retained in their current grade (as cited in Baenen et al., 1997). 


Woodcock Reading 
Mastery Test-Revised 


A standardized test composed of six subtests. The subtests measure ability to form associations between visual stimuli and oral responses; ability to recognize upper- and 
lowercase letters in a variety of fonts; ability to read words aloud; ability to read aloud nonsense words or uncommon words to test phonic and structural analysis skills for 
pronouncing unfamiliar words; vocabulary ability through the use of antonyms, synonyms, and analogies; and passage comprehension by filling in missing words in a short 
paragraph (as cited in Pinnell et al., 1994). 



1. The Clay Observation Survey was developed by Dr. Marie Clay, who also developed Reading Recovery®. Although there is no evidence of obvious overalignment between the measure and the 
intervention (intervention student receiving exposure to the measure during the course of treatment), it should be noted that the developer of the intervention and the measure were the same. 
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Appendix A3.1 Summary of findings for all domains 1 



Domain 




Alphabetics 






Comprehension 




Phonemic 








Reading Vocabulary 


General reading 


awareness 


Print awareness Letter knowledge 


Phonics 


Fluency 


comprehension development 


achievement 



Met evidence standards 

Baenen etal., 1997 


nr 


nr 




nr 


nr 


nr 


nr 


nr 


ind 


Pinnell, DeFord, & Lyons, 1988 


nr 


+ 




ind 


(+) 


nr 


+ + 


+ 


Pinnell et al., 1994 


nr 


nr 




nr 


nr 


nr 


nr 


nr 


+ 


Schwartz, 2005 


(+) 


+ 




(+) 


+ 


+ 


ind 


nr 


+ 


Met evidence standards with reservations 

Iverson &Tunmer, 1993 + 


+ 




+ 


+ 


nr 


nr 


nr 


+ 


Rating of effectiveness 






Positive 






Potentially positive 


Potentially positive 


Positive 



nr = no reported outcomes under this domain or construct 
+ = study average finding was positive and statistically significant 

(+) = study average finding was positive and substantively important, but not statistically significant 

ind = study average finding was indeterminate, that is, neither substantively important nor statistically significant 

1 . This appendix reports summary findings of study averages that were considered for the effectiveness rating and the improvement index in each domain. More detailed information on findings for the all measures within the domains and 
the constructs that factor into the domains are reported in Appendices A3.2-A3.5. 
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Appendix A3.2 Summary of study findings included in the alphabetics domain by construct 1 









Author’s findings from the studies 














Mean outcome 
(standard deviation 2 ) 




WWC calculations 




Outcome measure 


Study 

sample 


Sample size 


Reading 

Recovery ® (RR) Comparison 

group group 


Mean 

difference 3 (RR- 
comparison) 


Statistical 
significance 5 
Effect size 4 (at a = 0.05) 


Improvement 

index 6 



Phonemic awareness construct 



Schwartz, 2005 (randomized controlled trial) 7 



Deletion task 8 


Grade 1 


74 students 


6.64 

(2.56) 


5.58 

(2.50) 


1.06 


0.41 


ns 


+16 


Yopp-Singer Phoneme 
Segmentation Test 8 


Grade 1 


74 students 


17.70 

(4.93) 


15.27 

(5.43) 


2.43 


0.46 


ns 


+18 




Iverson & Tunmer, 1993 (quasi-experimental design) — standard /?/? group vs. comparison group 7 






Phoneme deletion task 8 


Grade 1 


64 students 


14.00 


3.91 


10.09 


1.73 


Statistically 


+46 






14 schools 


(6.16) 


(5.31) 






significant 




Yopp-Singer Phoneme 


Grade 1 


64 students 


17.63 


6.21 


11.42 


2.33 


Statistically 


+49 


Segmentation Test 8 




14 schools 


(4.46) 


(5.18) 






significant 




Print awareness construct 




















Pinnell, DeFord, & Lyons, 1988 (randomized controlled trial)- 


— /?/? group without RR teachers vs. comparison group 7 




Observation Survey: Concepts 


Grade 1 


88 students 


16.80 


13.98 


2.82 


0.89 


Statistically 


+31 


about Print subtest 9 






(2.91) 


(3.31) 






significant 








Schwartz, 2005 (randomized controlled trial) 7 








Observation Survey: Concepts 


Grade 1 


74 students 


19.24 


16.68 


2.56 


1.04 


Statistically 


+35 


about Print subtest 9 ' 10 






(2.55) 


(2.30) 






significant 






Iverson & Tunmer, 1993 (quasi-experimental design) — standard /?/? group vs. comparison group 7 






Observation Survey: Concepts 


Grade 1 


64 students 


18.12 


13.72 


4.40 


1.58 


Statistically 


+44 


about Print subtest 9 




14 schools 


(1.74) 


(3.49) 






significant 





Letter knowledge construct 



Observation Survey: Letter 
Identification subtest 9 



Pinnell, DeFord, & Lyons, 1988 (randomized controlled trial) — /?/? group without RR teachers vs. comparison group 7 

Grade 1 88 students 48.05 49.61 -1.56 -0.24 ns 



48.05 

(1.41) 



49.61 

(8.33) 



-10 



(continued) 
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Appendix A3.2 Summary of study findings included in the alphabetics domain by construct (continued) 









Author’s findings from the studies 














Mean outcome 
(standard deviation 2 ) 




WWC calculations 




Outcome measure 


Study 

sample 


Sample size 


Reading 

Recovery ® (RR) Comparison 

group group 


Mean 

difference 3 (RR- 
comparison) 


Statistical 
significance 5 
Effect size 4 (at a = 0.05) 


Improvement 

index 6 



Observation Survey: Letter 
Identification subtest 9 ’ 10 


Grade 1 


74 students 


Schwartz, 2005 (randomized controlled trial) 7 

52.28 51.68 0.60 

(1.27) (2.78) 


0.23 


ns 


+9 




Iverson & Tunmer, 1993 (quasi-experimental design) — standard /?/? group vs. comparison group 7 






Observation Survey: Letter 


Grade 1 


64 students 


53.53 


49.00 


4.53 


0.94 


Statistically 


+33 


Identification subtest 9 




14 schools 


(1.01) 


(6.63) 






significant 




Phonics construct 




















Pinned, DeFord, & Lyons, 1988 (randomized controlled trial portion) — /?/? group without RR teachers vs. comparison group 




Observation Survey: Word 


Grade 1 


88 students 


13.29 


11.98 


1.31 


0.41 


ns 


+16 


Recognition subtest 9 






(1.63) 


(3.92) 














Schwartz, 2005 (randomized controlled trial) 7 








Observation Survey: Word 


Grade 1 


74 students 


14.96 


8.87 


6.09 


1.37 


Statistically 


+41 


Recognition subtest 9 ’ 10 






(3.99) 


(4.75) 






significant 






Iverson & Tunmer, 1993 (quasi-experimental design) — standard /?/? group vs. comparison group 7 






Dolch Word Recognition Test 9 


Grade 1 


64 students 


94.87 


20.22 


74.65 


3.24 


Statistically 


+50 






14 schools 


(29.07) 


(13.93) 






significant 




Observation Survey: Word 


Grade 1 


64 students 


12.66 


5.56 


7.10 


2.33 


Statistically 


+49 


Recognition subtest 9 




14 schools 


(1.85) 


(3.83) 






significant 




Pseudoword decoding task 8 


Grade 1 


64 students 


9.00 


1.41 


7.59 


1.43 


Statistically 


+42 






14 schools 


(7.07) 


(2.31) 






significant 




Averages 


















Average 11 for alphabetics (Pinned, DeFord, & Lyons, 1988) 








0.35 


ns 


+14 


Average 11 for alphabetics (Schwartz, 2005) 










0.70 


Statistically 


+26 
















significant 




Average 11 for alphabetics (Iverson & Tunmer, 1993) 








1.94 


Statistically 


+47 
















significant 





(continued) 
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